Trying to improve phone and word recognition using finely tuned phone-like units

نویسندگان

  • Kåre Sjölander
  • Jesper Högberg
چکیده

Phone-like units (PLUs) for automatic speech recognition are derived using a decision tree algorithm. In our approach we use information such as target phone label, immediate context, lexical stress level and function word affiliation in the decision tree analysis. The resulting PLUs are shown to improve phone and word recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effects of allophones on the performance of Korean speech recognition

This paper investigates the effects of allophones on the performance of Korean speech recognition systems. Along with a baseline phone-like unit (PLU) set consisting of phonemes, 31 allophone-based PLU sets are designed by systematically considering 5 major Korean allophonic constraints which can describe all the PLU sets currently used for Korean speech recognition systems. Experiments for pho...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Improving the Arabic Pronunciation Dictionary for Phone and Word Recognition with Linguistically-Based Pronunciation Rules

In this paper, we show that linguistically motivated pronunciation rules can improve phone and word recognition results for Modern Standard Arabic (MSA). Using these rules and the MADA morphological analysis and disambiguation tool, multiple pronunciations per word are automatically generated to build two pronunciation dictionaries; one for training and another for decoding. We demonstrate that...

متن کامل

Communication Behaviour of Farmers with the Agricultural Extension Agents Using Cell Phone: A Case of Bangladesh

The cell phone is one of the potential Information Communication Technologies (ICTs) in agricultural development especially in developing countries like Bangladesh. Thus, this paper deals with the farmers’ communication with the agricultural extension agents using mobile phone. The study was conducted in Mymensingh District in Bangladesh. Data were collected from a sample of 110 farmers while b...

متن کامل

Incorporating information from syllable-length time scales into automatic speech recognition

Including information distributed over intervals of syllabic duration (100–250 ms) may greatly improve the performance of automatic speech recognition (ASR) systems. ASR systems primarily use representations and recognition units covering phonetic durations (40–100 ms). Humans certainly use information at phonetic time scales, but results from psychoacoustics and psycholinguistics highlight the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007